g
[
g,
,
]
R function for the random forest algorithm is in the
Forest
package.
The
R
function
is
also
called
Forest. The random forest algorithm can be used for both
tion and regression analysis. A random forest model is
ed using the following code,
randomForest(formula,data,⋯)
e 3.49(a) shows the ROC curve of a random forest model for the
ncer data.
andom forest algorithm can also analyse non-numerical data. This
e it is developed based on decision tree algorithms. The factor Xa
data was used for the demonstration. Figure 3.49(b) shows the
ve for this random forest model constructed for the data, where
ing process was used to encode the peptides.
dition to randomForest, the other package named as party
be used for random forest data analysis. The function is named as
t. The format of using this package for constructing a random
del using party is shown below,
cforest (formula,data,control)
e 3.49(c) shows the ROC curve of a random forest model
ed using the party package for the factor Xa protease cleavage
ain, no encoding process was used for the amino acids in the
dvantage of the party package is that it can generate a tree for
ation and visualisation. For instance, Figure 3.50 shows a tree
d by the party package for the breast cancer data set. Using this
possible to find which variables play the significant roles in the
mour classification problem, hence providing useful knowledge
t cancer diagnosis. The tree also shows the significance of each
i.e., a p value associated with a variable. For instance, the
X24 had the least p value and hence was the most significant
for breast cancer diagnosis based on a model constructed for this